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SEQUENCE LISTING 
GENERAL INFORMATION: 
(i) APPLICANT: Tartaglia, Louis A. 
Weng, Xun 

(ii) TITLE OF INVENTION: NUCLEIC ACID MOLECULES ENCODING 

GLUTEX AND USES THEREOF 
(iii) NUMBER OF SEQUENCES: 10 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 
(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 
(vi) CURRENT APPLICATION DATA: . 

(A) APPLICATION NUMBER: US/09/981 , 947A 

(B) FILING DATE: 18-Oct-2001 
(vii) PRIOR APPLICATION DATA: . 

(A) APPLICATION NUMBER: 09/031,392 

(B) FILING DATE: 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Meiklejohn, Ph.D., Anita L. 

(B) REGISTRATION NUMBER: 35,283 

(C) REFERENCE/DOCKET NUMBER: 07334/072002 
(ix)- TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: '617/54 2-5070 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 200154 
INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
CDNA 




(ii) MOLECULE TYPE: 
(ix) FEATURE: 

( A ) NAME/KEY : 

(B) LOCATION: 
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94 240 245 250 

95 GAC AGC CCA CGC TAC CTG CTC TTG GAG AAG CAC AAC GAG GCA AGA GCT 879 

96 Asp Ser Pro Arg Tyr Leu Leu Leu Glu Lys His Asn Glu Ala Arg Ala 

97 255 260 265 

98 GTG AAA GCC TTC CAA ACG TTC TTG GGT AAA GCA GAC GTT TCC CAA GAG 927 

99 Val Lys Ala Phe Gin Thr Phe Leu Gly Lys Ala Asp Val Ser Gin Glu 

100 270 275 280 285 

101 GTA GAG GAG GTC CTG GCT GAG AGC CAC GTG CAG AGG AGC ATC CGC CTG 975 

102 Val Glu Glu Val Leu Ala Glu Ser His Val Gin Arg Ser He Arg Leu 

103 290 295 300 

104 GTG TCC GTG CTG GAG CTG CTG AGA GCT CCC TAC GTC CGC TGG CAG GTG 102 3 
10 5 Val Ser Val Leu Glu Leu Leu Arg Ala Pro Tyr Val Arg Trp Gin Val 

106 305 310 315 

107 GTC ACC GTG ATT GTC ACC ATG GCC TGC TAC CAG CTC TGT GGC CTC AAT 1071 

108 Val Thr Val He Val Thr Met Ala Cys Tyr Gin Leu Cys Gly Leu Asn 

109 320 325 330 

110 GCA ATT TGG TTC TAT ACC AAC AGC ATC TTT GGA AAA GCT GGG ATC CCT 1119 

111 Ala He Trp Phe Tyr Thr Asn Ser He Phe Gly Lys Ala Gly He Pro 

112 335 340 345 

113 CCG GCA AAG ATC CCA TAC GTC ACC TTG AGT ACA GGG GGC ATC GAG ACT 116 7 

114 Pro Ala Lys He Pro Tyr Val Thr Leu Ser Thr Gly Gly He Glu Thr 

115 350 355 360 365 

116 TTG GCT GCC GTC TTC TCT GGT TTG GTC ATT GAG CAC CTG GGA CGG AGA 1215 

117 Leu Ala Ala Val Phe Ser Gly Leu Val He Glu His Leu Gly Arg Arg 

118 370 375 380 

119 CCC CTC CTC ATT GGT GGC TTT GGG CTC ATG GGC CTC TTC TTT GGG ACC 126 3 

120 Pro Leu Leu lie Gly Gly Phe Gly Leu Met Gly Leu Phe Phe Gly Thr 

121 385 390 395 

122 CTC ACC ATC ACG CTG ACC CTG CAG GAC CAC GCC CCC TGG GTC CCC TAC 1311 
12 3 Leu Thr He Thr Leu Thr Leu Gin Asp His Ala Pro Trp Val Pro Tyr 

124 400 405 -410 

12 5 CTG AGT ATC GTG GGC ATT CTG GCC ATC ATC GCC TCT TTC TGC AGT GGG 1359 

126 Leu Ser He Val Gly He Leu Ala He He Ala Ser Phe Cys Ser Gly 

127 415 420 425 

128 CCA GGT GGC ATC CCG TTC ATC TTG ACT GGT GAG TTC TTC CAG CAA TCT 14 07 

129 Pro Gly Gly He Pro Phe He Leu Thr Gly Glu Phe Phe Gin Gin Ser 

130 430 435 440 445 

131 CAG CGG CCG GCT GCC TTC ATC ATT GCA GGC ACC GTC AAC TGG CTC TCC 14 55 

132 Gin Arg Pro Ala Ala Phe He He Ala Gly Thr Val Asn Trp Leu Ser 

133 450 455 460 

134 AAC TTT GCT GTT GGG CTC CTC TTC CCA TTC ATT CAG AAA AGT CTG GAC 1503 

135 Asn Phe Ala Val Gly Leu Leu Phe Pro Phe He Gin Lys Ser Leu Asp 

136 465 470 475 

137 ACC TAC TGT TTC CTA GTC TTT GCT ACA ATT TGT ATC ACA GGT GCT ATC 1551 

138 Thr Tyr Cys Phe Leu Val Phe Ala Thr He Cys He Thr Gly Ala He 

139 480 485 490 

140 TAC CTG TAT TTT GTG CTG CCT GAG ACC AAA AAC AGA ACC TAT GCA GAA 1599 

141 Tyr Leu Tyr Phe Val Leu Pro Glu Thr Lys Asn Arg Thr Tyr Ala Glu 

142 495 500 505. 
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14 3 ATC AGC CAG GCA TTT TCC AAA AGG AAC AAA GCA TAC CCA CCA GAA GAG 164 7 

144 lie Ser Gin Ala Phe Ser Lys Arg Asn Lys Ala Tyr Pro Pro Glu Glu 

145 510 515 520 525 

14 6 AAA ATC GAC TCA GCT GTC ACT GAT GCT CCT GCT TCT TCT CCT TTC ACT 1695 

147 Lys lie Asp Ser Ala Val Thr Asp Ala Pro Ala Ser Ser Pro Phe Thr 

148 530 ' 535 540 

14 9 ACT CCG A AT ACA GCC TGG ATT CAA GCT GCC GCC ACC ACC ACC GCC ACC 174 3 

150 Thr Pro Asn Thr Ala Trp lie Gin Ala Ala Ala Thr Thr Thr Ala Thr 

151 545 550 555 

152 AAA AAA GAA CAC CCA TTG TAAACGGTCA TGTGGTATTT CCTCAACCTG GAATGACC 1799 

153 Lys Lys Glu His Pro Leu 

154 560 

155 TTCCCCTATC TTCTTCTCCT GGAGAACACC AAGTCATGAT GTCAGACAAG AGCTTGGATT 1859 

156 TTGGAGACAT GGGTTTGAAT TCCAGTCATT CATTCTTTTA TTCAGCAAAT ATTTAACAAG 1919 

157 TACTGACATG TCCCATATGT TGTTTTACCC ACTGGTTATA CAATGGGAGG GAGAGAGAGA 197 9 

158 GAGAGAGAGA GAGAGAGATG CTATTCTAAA AGCTTGAAGT CTAGGCTGTG CACGGTGGCT 203 9 

159 CACGCCTGTA ATCCCAGCAC TTTGGGAGGC CGAGGTGGGT GGATCGTGAG GTCAGGAGAT 209 9 
16 0 TGAGACCATC CTGGCTAACA TGGTGAAACT CCCTCTCTAC TAAAAATACA AAAAATTAGC 2159 

161 TGAGCATGGT GGCGGGCGCC TGTAGTCCCA GCTACTTGGG AGGCTGAGGC AGGAGAATGG 2219 

162 CGTGAACCCA GGAGGCGGAG CTTGCAGTGA GCCGAGATCA CACCACCACA CTCCAGCCTG 2279 
16 3 GGTGACAGAG CCAGACTCCG TCTCAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAGGGCGG 2 339 
164 CCGC 2343 
16 6 (2) INFORMATION FOR SEQ ID NO : 2 : 

167 (i) SEQUENCE CHARACTERISTICS: 

168 (A) LENGTH: 563 amino acids 

169 (B) TYPE: amino acid 

170 (D) TOPOLOGY: linear 

171 (ii) MOLECULE TYPE: protein 

172 (v) FRAGMENT TYPE: internal 

173 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

174 Met Ala Arg Lys Gin Asn Arg Asn Ser Lys Glu Leu Gly Leu Val Pro 

175 15 10 15 

176 Leu Thr Asp Asp Thr Ser His Ala Gly Pro Pro Gly Pro Gly Arg Ala 

177 20 25 30 

178 Leu Leu Glu Cys Asp His Leu Arg Ser Gly Val Pro Gly Gly Arg Arg 

179 35 40 45 

180 Arg Lys Asp Trp Ser Cys Ser Leu Leu Val Ala Ser Leu Ala Gly Ala 

181 50 55 60 

182 Phe Gly Ser Ser Phe Leu Tyr Gly Tyr Asn Leu Ser Val Val Asn Ala 

183 65 70 75 80 

184 Pro Thr Pro Tyr lie Lys Ala Phe Tyr Asn Glu Ser Trp Glu Arg Arg 

185 85 90 95 

186 His Gly Arg Pro lie Asp Pro Asp Thr Leu Thr Leu Leu Trp Ser Val 

187 100 105 110 

188 Thr Val Ser lie Phe Ala lie Gly Gly Leu Val Gly Thr Leu lie Val 

189 115 120 125 

190 Lys Met lie Gly Lys Val Leu Gly Arg Lys His Thr Leu Leu Ala Asn 

191 130 135 140 

192 Asn Gly Phe Ala lie Ser Ala Ala Leu Leu Met Ala ' Cys Ser Leu Gin 
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x nr 


Asn 


Ser 


Tl p> 
lie 


Php 

IT IlC 


yjxy 


Lys 


Ala 
c\x a 


rl v 
uiy 


Tl p 
X X c 


Pro 


-L x w 


Ala 


j_iy o 


917 
Z X / 








7 A 0 

J *i VJ 










O ** J 










? SO 
o j \) 






91 Q 

z i o 


Tip 

lie 


rlO 


lyr 


Val 


Th v 
X fix 


Leu 


C o v 

OCX 


X 111 


uiy 


Cl \7 


Tl p 
lie 


VjiX u 


rp h t* 
X I1X 


XjC IX 


Ala 


A 1 a 
nX a 


9 1 Q 
Z 1 :? 






-ice 

ODD 










^fi 0 

O D \J 










J U J 








990 
z z u 


Va 1 
V O, X 


Php 

Jr lie 


Car 
OCX 




T.pi i 
JuC u. 


Va 1 
v a x 


Tip 
X X c 


ri l ii 


n x o 


1-lC Ll 


vj x y 




A TCT 


Pro 


Leu 


Leu 


9 91 

Z Z X 




^70 










375 










380 










9 9 9 
Z Z Z 


Tl p 
11c 


C 1 \7 


H 1 T7 

^jxy 


rile 


C "1 \T 

\jXy 


Leu 


rUc t- 


r^l \7 


Leu 


tr lie 


Php 
ir lie 


fZl \r 
oiy 


Th t* 
X 11X 


T.PI l 
XjC U. 


Thv 
X I1X 


Tl p 

lie 


Z Z o 


~k ft R 










^ QO 




















ADO 

t \j \j 


9 9 A 
z z *± 


Th t* 
X I1X 


Leu 


X ll-L 


Leu 


Vjl 11 


Asp 


nxo 


Ala 
r\x a 


"P >- r^i 

CX \J 


xrp 


Va 1 
v a x 


IrX W 


r P\7T" 

xyr 


Leu 


Ser 


He 


9 9 R 
z z ~> 










AOS 
4 V J 










410 










415 




9 9 £ 
Z Z O 


Va 1 

V a 1 


rl v 
Vjiy 


T 1 p 
11c 


Leu 


Ala 
Ala 


Tl p 
X X c 


Tl p 
X xc 


Ala 
A.X ci 


OCX 


P"h<a 
Ir IlC 


v— y o 


OCX 


Cl \7 

uiy 


Prn 

Jr X w 


\d x y 


fZl \7 

vjx y 


9 9 7 
Z Z f 








A 9 0 
*i Z u 










A 9 R 

*± z J 










A ?0 

*± J VJ 






9 9 ft 


Tip 

x ie 


Px"0 


Jr ne 


Tl o 

x ie 


Leu 


i nr 


vjiy 


r* 1 1 i 

blU 


IT J1C 


Ir IlC 


n n 

VjX 11 


n n 

VjX 11 


Coy* 
OCX 


n n 

VjT X II 


Arg 


Prn 

tr x u 


9 9 Q 
Z Z 






A 7 R 
ft j D 










a a n 

*i *t U 










A A R 
*t *i J 








"1 O A 


Ala 


Ala 


Phe 


lie 


lie 


Ala 


Gly 


Thr 


val 


Asn 


Trp 


Leu 


Ser 


Asn 


Fne 


Ala 


231 




450 










455 










460 










232 


Val 


Gly 


Leu 


Leu 


Phe 


Pro 


Phe 


He 


Gin 


Lys 


Ser 


Leu 


Asp 


Thr 


Tyr 


Cys 


233 


465 










470 










475 










480 


234 


Phe 


Leu 


Val 


Phe 


Ala 


Thr 


He 


Cys 


He 


Thr 


Gly 


Ala 


He 


Tyr 


Leu 


Tyr 


235 










485 










490 










495 




236 


Phe 


Val 


Leu 


Pro 


Glu 


Thr 


Lys 


Asn 


Arg 


Thr 


Tyr 


Ala 


Glu 


He 


Ser 


Gin 


237 








500. 










505 










510 






238 


Ala 


Phe 


Ser 


Lys 


Arg 


Asn 


Lys 


Ala 


Tyr 


Pro 


Pro 


Glu 


Glu 


Lys 


He 


Asp 


239 






515 










520 










525 








240 


Ser 


Ala 


Val 


Thr 


Asp 


Ala 


Pro 


Ala 


Ser 


Ser 


Pro 


Phe 


Thr 


Thr 


Pro 


Asn 


241 




530 










535 










540 
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VERIFICATION SUMMARY DATE: 02/08/2002 

PATENT APPLICATION: US/09/981 , 947A TIME: 10:08:11 

Input? Set : N:\Crf3\RULE60\09981947.raw 
Output Set: N:\CRF3\02082002\I981947A.raw 

L:20 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
L:21 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L:598 M:246 W: Invalid value of Alpha Sequence Header Field, [MOLECULE TYPE:], SeqNo=8 

L:608 M:246 W: Invalid value of Alpha Sequence Header Field, [MOLECULE TYPE:], SeqNo=9 

L:619 M:341 W: (46) n n" or "Xaa" used, for SEQ ID#:10 

L:625 M:341.W: (46) "n" or "Xaa" used, for SEQ ID#:10 

L:629 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:10 

L:633 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:10 

L:649 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:10 

L:651 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:10 

L:677 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:10 

L:679 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:10 
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